Subtopic segmentation in the lecture speech
نویسندگان
چکیده
This paper proposes a method of segmentation that segments lecture videomaterial into subtopics based on speech signals for creation of educational video contents. To represent subtopics of video segments, the text recognized by automatic speech recognition (ASR) from a lecture speech was converted into an index using independent component analysis (ICA) instead of conventional TFIDF. This research attempted a method of segmentation using dynamic programming that minimizes the sum of cosine measures between adjacent indexes. The validity of the proposed method was evaluated using sample lecture videos. Results indicated that subtopic segmentation using automatic speech recognition performed as well as that using transcription text.
منابع مشابه
Lecture subtopic retrieval by retrieval keyword expansion using subordinate concept
We developed a supporting system for creation of educational video contents. The system automatically segments a lecture video material into subtopics based on speech signals by a statistical model for text segmentation. In this paper, we reports on the result of retrieving the lecture subtopics by keyword expansion using the knowledge of the dictionary, and so on. The keyword expansion using t...
متن کاملTopic segmentation and retrieval system for lecture videos based on spontaneous speech recognition
In this paper, we propose a segmentation method of continuous lecture speech into topics. A lecture includes several topics but it is difficult to judge their boundaries. To solve this problem, transcriptions obtained by spontaneous speech recognition of a lecture speech is associated with the textbook used in the lecture. This method showed high performance of the topic segmentation with an av...
متن کاملSubtopic annotation and automatic segmentation for news texts in Brazilian Portuguese
Subtopic segmentation aims to break documents into subtopical text passages, which develop a main topic in a text. Being capable of automatically detecting subtopics is very useful for several Natural Language Processing applications. For instance, in automatic summarisation, having the subtopics at hand enables the production of summaries with good subtopic coverage. Given the usefulness of su...
متن کاملTextTiling: Segmenting Text into Multi-paragraph Subtopic Passages
TextTiling is a technique for subdividing texts into multi-paragraph units that represent passages, or subtopics. The discourse cues for identifying major subtopic shifts are patterns of lexical co-occurrence and distribution. The algorithm is fully implemented and is shown to produce segmentation that corresponds well to human judgments of the subtopic boundaries of 12 texts. Multi-paragraph s...
متن کاملText Segmentation Using Reiteration and Collocation
A method is presented for segmenting text into subtopic areas. The proportion of related pairwise words is calculated between adjacent windows of text to determine their lexical similarity. The lexical cohesion relations of reiteration and collocation are used to identify related words. These relations are automatically located using a combination of three linguistic features: word repetition, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004